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receiving from the user, data representative of one or more selected categorical labels; 



[and] 



labeling the document within the collection with the one or more selected categorical 
label s: and 

incrementally retrainingvthe classifier to adapt to modifications of the collection . 



12. The method of claim 1 1 fur her comprising the step of deriving a plurality of 
categorization shortcuts from the pi arality of most likely categorical labels, wherein the 
displaying step comprises the step c f displaying, to the user, the plurality of categorization 
shortcuts. y 



Amend claims 13-14 as follows: 



13. (amended) The method of claim 1 1 wherein the classifying step comprises the step of 
classifying, upon receipt into data storage, the document to obtain the plurality of most likely 
categorical labels. 



14. (amended) The method of claim 12 wherein the deriving step comprises the step of 
deriving, upon receipt of the document into data storage, categorization shortcuts from the 
plurality of most likely categorical labels. 



1 5 . The method of claim 1 2 wherein the 



deriving step comprises the step of labeling display 



buttons with the plurality of most likely cate prical labels, and the displaying step comprises the 



step of displaying the labeled display button 



with the document. 
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Amend claim 16 as follows: 



16. (amended) TMie method of claim 12 wherein the deriving step comprises the step of 
> creating an ordered sejf of the plurality of most likely categorical labels, and the displaying step 
comprises the step of displaying with the document the ordered set prepended to a standard 
[ordering] ordered set of other categorical labels. 



17. The method of claim 1 1 wherein th,e 
with the displaying step. 

1 8 . The method of claim 1 1 wherein th( 
upon invocation by the user, the document 
labels. 



classifying step occurs substantially simultaneously 



classifying step comprises the step of classifying, 
o obtain the plurality of most likely categorical 



19. The method of claim 1 8 wherein the invocation comprises a selection by the user of a 
classify button. 

20. The method of claim 1 1 wherein the labeling step comprises the step of storing the 
document in folders or locations of the collectio\i corresponding to the one or more selected 
categorical labels. 



21. The method of claim 1 1 further comprising the step of displaying a standard list of all 
categorical labels, wherein the receiving step comprises the step of receiving, from the user, data 
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representative of one or more selected categ< rical labels from either the plurality of displayed 
categorization shortcuts or the standard list. 



Cancel claim 22. 



Amend claims 23-24 as follows: 



23 . (amended) The m\thod of claim [22] JJ_ wherein the retraining step comprises the step of 
retraining the classifier in response to the labeling step. 




24. (amended) The methodW claim [22] jj. wherein the labeling step comprises the step of 
storing the document in folders oAlocations of the collection corresponding to the one or more 
selected categorical labels and the retraining step comprises the steps of: 

receiving, from the user, addition data representative of an addition of a document into a 
tofolder; and 

retraining the classifier in response to the addition data. 



25. The method of claim 24 wherein the retraiijiing step comprises the step of assigning, in the 
classifier, the added document to the tofolder. 




26. The method of claim 25 further comprising 
excluded from retraining and wherein the retraining 
classifier, the added document when the tofolder is ; 



e step of identifying excluded folders to be 
step comprises the step of assigning, in the 
ot one of the identified excluded folders. 
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Amend claims 27 as follows: 



27. (amended) The method of claim [22] U wherein the labeling step comprises the step of 
storing the document in folders or locations of the collection corresponding to the one or more 
selected categorical labels and me retraining step comprises the steps of: 

receiving, from the user, Deletion data representative of a removal of a document from a 

from folder; and 

retraining the classifier in response to the deletion data. 



!6 



28. The method of claim 27 wherein the retraining step comprises the step of unassigning, in 



6- 



the classifier, the removed document from the 



rom folder in which it was categorized. 



classifier, the removed document when the froi 
folders. 



29. The method of claim 28 further compris: ng the step of identifying excluded folders to be 
excluded from retraining and wherein the retraii ing step comprises the step of unassigning, in the 



folder is not one of the identified excluded 



Amend claims 30 as follows: 



1 30. (amended) The method of claim [E2] JT wherein the labeling step comprises the step of 
Storing the document in folders or location^ of the collection corresponding to the one or more 
selected categorical labels and the retraining step comprises the steps of: 

receiving, from the user, move data representative of a movement of a document from a 
source folder to a destination folder; and 

retraining the classifier in response to theYnove data. 
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3 1 . The method of claim 30 wherein the retraining step comprises the steps of: 

unassigning, in the classifier, the i loved document from the source folder in which it was 
categorized; and 

assigning, in the classifier, the movfed document to the destination folder. 



32. The method of claim 3 1 further comprising the step of identifying excluded folders to be 
excluded from retraining and wherein the retraining step comprises the steps of: 

unassigning, in the classifier, the moved document when the source folder is not one of 
the identified excluded folders; and 

assigning, in the classifier, the moved document when the destination folder is not one of 
the identified excluded folders. 



Amend claims 33-36 as follows: 



33. (amended) The method of claim [22] U wherein the retraining step occurs mstafrtty after 
a collection modification. 

34. (amended) The method of\;laim [22] U wherein the retraining step occurs a fixed 
amount of time after a last retrainingor an initial training from scratch. 



3 5 . (amended) The method of claim [221 JJ, wherein the retraining step occurs when a 
threshold number of documents have been added, deleted or moved in the collection. 
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36 (amended) The method of claim [22] JJ, wherein the re-training step occurs when an idle 



state is reachec 



37. 



The method of claim 20, wherein the cla sifying step comprises the steps of: 

tokenizing the document into different tokens; 

tallying a number of occurrences of each loken in the document; 



computing, for each folder, a token weigh 
comparing, for each token, the number of 



of each token; 

i iccurrences and the token weights; 



creating a similarity score in response to tb : comparing step; and 



identifying a subset of folders for which the 



38. The method of claim 37 further comprisin 
subset, all folders for which the similarity score is 



similarity score is highest. 



g the step of removing, from the identified 
lower than a default or specified threshold. 



39. The method of claim 37, wherein the competing step comprises the step of computing the 
token counts of each token in each of the folders. 



40. The method of claim 37 wherein the tokeniz 
separately tokenizing different portions of th 



ng step comprises the steps of: 
document; and 



labeling the tokens according to the different portions. 

4 1 . The method of claim 25 wherein the classifyinjg step comprises the steps of: 
tokenizing the document into different tokens; 
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tallying a number of occurrences of each token in the document; 

retrieving, for each folder, a tokebcount of each token; 

computing, for each folder, a tokan weight of each token; 

comparing, for each token, the number of occurrences and the token weights; 

creating a similarity score in response to the comparing step; and 

identifying a subset of folders for which the similarity score is highest, and 
wherein the assigning step comprises the ste|p of adding the number of occurrences of each token 
to the tokencount of the tofolder. 



42. The method of claim 28 wherein the classifying step comprises the steps of: 
tokenizing the document into different tokens; 
tallying a number of occurrences of each token in the document; 



retrieving, for each folder, a tokencount 
computing, for each folder, a token weij ; 
comparing, for each token, the number c 
creating a similarity score in response to 



of each token; 
it of each token; 

occurrences and the token weights; 
he comparing step; and 



identifying a subset of folders for which tie similarity score is highest, and 
wherein the unassigning step comprises the step c-f subtracting the number of occurrences of each 
token from the tokencount of the fromfolder. 



43 . The method of claim 3 1 wherein the classifying step comprises the steps of: 
tokenizing the document into different tokens; 
tallying a number of occurrences of each tokfen in the document; 
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retrieving, for each folder, a tokencount of each token; 

computing, for each folder, a token weight of each token; 

comparing, for each token, the nVimber of occurrences and the token weights; 

creating a similarity score in resppnse to the comparing step; and 

identifying a subset of folders for Which the similarity score is highest, and 
wherein the unassigning step comprises thte step of subtracting the number of occurrences of each 
token from the tokencount of the source folder, and the assigning step comprises the step of 
adding the number of occurrences of each tbken to the tokencount of the destination folder. 

44. The method of claim 1 1, further comprising the step of training the classifier from scratch 
with a pre-existing collection of categorized documents. 

45. The method of claim 44 wherein the labeling step comprises the step of storing the 
document in folders or locations of the collection corresponding to the one or more selected 
categorical labels and the training step comprised the step of assigning, in the classifier, each of 
the pre-existing documents to a folder in which it is categorized. 

46. The method of claim 45 wherein the classifying step comprises the steps of: 
tokenizing the document into different tokfens; 

tallying a number of occurrences of each token in the document; 

retrieving, for each folder, a tokencount of each token; 

computing, for each folder, a token weight of each token; 

comparing, for each token, the number of occurrences and the token weights; 
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creating a similarity score in response to the comparing step; and 
identifying a subset of folders for which the similarity score is highest, and 

wherein the assigning step comprises the step of adding the number of occurrences of each token 

to the tokencount of the tof older 



47. The method of claim 45 furthW comprising the step of identifying excluded folders to be 
excluded from training and wherein thfe training step comprises the step of assigning, in the 
classifier, each of the pre-existing documents, except those in the identified excluded folders. 

48. The method of claim 1 1 wherein th& labeling step comprises the step of storing the 
document in folders or locations of the collection corresponding to the one or more selected 
categorical labels and the re-training step comprises the steps of: 

determining a time of a last step of re-training; and 

retraining the classifier on each folder which was modified after the determined time. 



Amend claim 49 as follows: 



49. (amended) The method of claim [22] H wherfein the labeling step comprises the step of 
storing the document in folders or locations of the collection corresponding to the one or more 
selected categorical labels, the method further comprising the step of training the classifier from 
scratch with a pre-existing collection of categorized documents, wherein the re-training step 
comprises the steps of: 

determining a time of the step of training or a last stk> of re-training; and 
retraining the classifier on each folder which was modified after the determined time. 
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50. The method of claim 1 1 wherein the classifying step uses the TF-IDF principle. 



5 1 . The method of claim 1 1 wherein the electronic document is an e-mail message. 



52. The method of claim 1 1 wherein the electroijiic document is a web page and the collection 
is a collection of bookmarks. 

53. The method of claim 41 wherein the electronicidocument is a web page and the collection 
is a collection of bookmarks, the method further composing the step of storing, for each web 
page, a pagetokencount matching the tallied number of occurrences of each token. 



54. The method of claim 42 wherein the electronic document is a web page and the collection 
is a collection of bookmarks, the method further comprising the step of storing, for each web 
page, a pagetokencount matching the tallied number of occurrences of each token, wherein the 
unassigning step comprises the step of subtracting the pagetokencount from the tokencount of the 
fromfolder. 



55. The method of claim 43 wherein the electronic docuinent is a web page and the collection 
is a collection of bookmarks, the method further comprising the step of storing, for each web 
page, a pagetokencount matching the tallied number of occurrences of each token, wherein the 
unassigning step comprises the step of subtracting the pagetoKencount from the tokencount of the 
fromfolder. 
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56. The method of claim 1 1 wherein the Electronic document is a multimedia document 



57. The method of claim 56 wherein the nSultimedia document is an image file, a video file or 
an audio file. 

58. The method of claim 56 wherein the mi iltimedia document combines any combination of 
text, an image file, a video file and an audio fik . 



59. The method of claim 57 wherein the mu ltimedia document further includes text. 



60. The method of claim 1 1 wherein the ele 



viewable in their entirety, but can be categorize 




tronic document comprises data sets that are not 
1 in response to some presentation to the user. 

machine, tangibly embodying a program of 



61 . A program storage device, readable by a 

nstructions executable by the machine to perforin method steps for assisting a user with the task 



of categorizing an\lectronic document into a co 
11.--. 



lection according to the method steps of claim 
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